Scene Text Area Detection from Video
نویسندگان
چکیده
Text detection from videos is a well known research area. Especially the detection of static superimposed text such as captions has been researched successfully, but makes many assumptions that question the applicability of those algorithms for moving scene text. In this dissertation, I propose a scene text area detection approach that includes a simple key frame extraction, feature extraction, feature code generation and text area classification. Common edge and variance based features of scene text areas are evaluated and comprised in a "combined feature scheme". For the detection of text areas, two classifiers using a self-organising map and a feed-forward neural network are compared. Ground truth video data with different characteristics is used to compare the neural computing methods. A combination of detection performance measures and changing features shows the applicability of edge and variance based features and leads to the proposal of improvements of the "combined feature scheme". Car license plates serve as sample text areas in this research.
منابع مشابه
Compressed Domain Scene Change Detection Based on Transform Units Distribution in High Efficiency Video Coding Standard
Scene change detection plays an important role in a number of video applications, including video indexing, searching, browsing, semantic features extraction, and, in general, pre-processing and post-processing operations. Several scene change detection methods have been proposed in different coding standards. Most of them use fixed thresholds for the similarity metrics to determine if there wa...
متن کاملAccurate video text detection through classification of low and high contrast images
Detection of both scene text and graphic text in video images is gaining popularity in the area of information retrieval for efficient indexing and understanding the video. In this paper, we explore a new idea of classifying low contrast and high contrast video images in order to detect accurate boundary of the text lines in video images. In this work, high contrast refers to sharpness while lo...
متن کاملTraffic Scene Analysis using Hierarchical Sparse Topical Coding
Analyzing motion patterns in traffic videos can be exploited directly to generate high-level descriptions of the video contents. Such descriptions may further be employed in different traffic applications such as traffic phase detection and abnormal event detection. One of the most recent and successful unsupervised methods for complex traffic scene analysis is based on topic models. In this pa...
متن کاملNF-SAVO: Neuro-Fuzzy system for Arabic Video OCR
In this paper we propose a robust approach for text extraction and recognition from video clips which is called Neuro-Fuzzy system for Arabic Video OCR. In Arabic video text recognition, a number of noise components provide the text relatively more complicated to separate from the background. Further, the characters can be moving or presented in a diversity of colors, sizes and fonts that are n...
متن کاملMulti-oriented text detection and verification in video frames and scene images
In this paper, we bring forth a novel approach of video text detection using Fourier-Laplacian filtering in the frequency domain that includes a verification technique using Hidden Markov Model (HMM). The proposed approach deals with the text region appearing not only in horizontal or vertical directions, but also in any other oblique or curved orientation in the image. Until now only a few met...
متن کامل